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(57) Abstract 



Heterologous protein expression 
system including a heterologous gene 
DNA such as EPO genomic DNA, 
a vector receiving the DNA and an 
avian cell, such as duck embryo or 
quail fibrosarcoma cell line, expressing 
the gene in the vector can be used to 
efficiently produce heterologous proteins 
such as EPO. 
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HETEROLOGOUS PROTEIN PRODUCTION SYSTEM 



USING AVIAN CELLS 



BACKGROUND OF THE INVENTION 



5 1 . Field of the Invention 

The present invention relates to novel expression systems that 
can produce biomedically important heterologous proteins including 
human erythropoietin (hereafter "EPO"). and more specifically to the 
production of various heterologous proteins by transfecting DNA 
10 encoding the proteins, such as the genomic DNA encoding EPO into 
avian cells. 

2. Related Arts 

Many recombinant proteins used in medicine are relatively small 
and simple in their structure, and biologically functional proteins can be 

15 produced in prokaryote such as E. coli. However, some human 
proteins of medical interest, such as TPA (tissue plasminogen 
activator), Factor VIII, EPO, etc. are more complicated because 
biological function requires post-transiational modification. For 
example, EPO is extensively glycosylated with the carbohydrate portion 

20 accounting for 40 % of the molecular mass. It has been shown that 
the carbohydrate portion of EPO is important for biological function. 
Accordingly, EPO produced in E. coli, yeast or insect is inactive or very 
weakly active in vivo, while EPO produced in COS or CHO cells was 
found to be fully active. Accordingly, those kinds of heterologous 

25 proteins have been produced only in mammalian cells. 



of gene expression in higher eukaryote for a long time. One of the first 
viruses to be linked to tumors was the Rous sarcoma virus of chicken. 



In the meantime, the avian system has been used for the study 
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and this virus was instrumental in demonstrating that the retroviral 
oncogene can originate from a cellular gene, leading to the concept of 
the protooncogen. Studies of gene expression have also been done 
using the RSV LTR promoter, which has often be used for high level 
5 expression of heterologous genes in mammalian cells. In addition, 
avian embryo cells have been used extensively in studies of various 
animal viruses. 

SUMMARY OF THE INVENTION 

The present invention is a research for the high level expression 
10 of eukaryotic heterologous proteins. It is an object of the present 
invention to provide a novel heterologous gene expression system 
which can produce proteins of higher eukaryotic cells. It is another 
object to provide the method of efficiently producing higher eukaryotic 
proteins, such as EPO, etc., which has been known to be active only 
15 when they are produced in a mammalian cell. It is a further object of 
the invention to provide the method of producing, especially, EPO 
among the eukaryotic proteins described above. 

To accomplish the objects of the present invention, the present 
invention provides a heterologous gene expression system comprising 
20 a DNA encoding a heterologous protein, a vector for receiving the DNA; 
and an avian cell for harboring the vector. 

The present invention also provides a method of producing a 
heterologous protein comprising the steps of culturing the expression 
system of claim 1 in media to express the heterologous gene, and 
25 purifying the heterologous proteins from the cell and the media. 

Preferably, the heterologous protein of the present invention is 
selected from the group consisting of those proteins that are known to 
be active only when expressed in mammalian cells (such as EPO, TPA, 
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Factor VIII, etc.) and preferably, the vector contains a promoter 
selected from the group consisting of SV early promoter, major 
immediate early promoter of human cytomegalovirus (hereafter "HCMV 
MIEP") and RSV LTR, and preferably, the avian cell is selected from 
5 the group consisting of duck embryo cell (hereafter "DE"), chicken 
embryo fibroblast (hereafter "CEF") and quail fibrosarcoma (hereafter 
"QT'), more preferably QT-VC which was isolated by the inventors. 
QT-VC was deposited to the International Depository Authority, Korea 
Research Institute of Bioscience and Biotechnology Korean Collection 
10 for Type Culture, and assigned a deposit number of KCTC 0277BP on 
August 22, 1996. The deposited QT-VC was transfected with the 
expression vector containing SY-EPO cDNA as described in Fig. 8. 

More preferably, the DNA encoding the heterologous protein is 
genomic DNA or cDNA. 

15 Further, the present invention provides an EPO production 

system comprising a DNA encoding EPO, a vector for receiving the 
DNA, and an avian cell for harboring the vector. 

Moreover, the invention provides a method of producing EPO 
comprising the steps of inserting a DNA encoding EPO into a vector, 
20 transfecting the vector into an avian cell, and culturing the transfected 
avian cell in media. 

Preferably, the avian cell of the EPO production system is DE or 
QT, and the DNA is a genomic DNA encoding EPO, more preferably, 
the DNA selected from the group consisting of SY, JM, SH and HE 
25 described in Fig. 5. 

Preferably, the vector has a promoter selected from the group 
consisting of SV early promoter, HCMV MIEP and RSV LTR. 

The present invention also provides an avian cell as a host for 
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expressing genes encoding mammalian proteins. 

Further, the present invention provides an novel EPO genomic 
sequence selected from the group consisting of SY, JM, SH and HE 
described in Fig. 5, and also provides an novel EPO amino acid 
5 sequence selected from the group consisting of JM, SH and HE 
described in Fig. 6. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 shows the expression of the bacterial CAT gene in avian 
cells. DE and CEF cells were transfected with pRc/CMV containing 

10 (+) or lacking (-) the CAT sequence. CAT activity was measured by 
determining the amount of acetylated chloramphenicol (AC) produced 
from ^^C-chloramphenicol. The values shown are from one 
representative of more than five independent assays. For this 
particular experiment, 10 \ig of protein was reacted with ^^C- 

15 chloramphenicol for 20 min at 37 °C. 

Fig. 2 shows the comparison of CAT gene expression between 
various cell types and between different promoters. The three 
promoter-CAT fusion constructs were transfected into DE, CEF, CHO- 
K1, and HeLa cells, and CAT activity was measured as described in 
20 Fig. 1. S, SV40 early promoter; C, HCMV MIEP; R, RSV LTR. The 
values shown are from one representative of three independent 
assays. For this particular experiment, 10 ^ig of protein was reacted 
with ^"^C-chloramphenicol for 30 min at 37 °C . 

Fig. 3 shows the efficiency of DMA transfection in various cells. 
25 pCMV-lacZ constructs was transfected into DE, CHO, Vero, HeLa, 
and 293T cells by calcium phosphate-DNA coprecipitation using the 
conditions used for the experiments shown in Fig. 2. Two days after 
transfection, cells were fixed and stained with X-gal. The number of 
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blue cells per 60 mm tissue culture plate was counted. The total 
number of cells between plates were comparable at 1-3 X 10^. 
Transfection efficiency was calculated relative to DE cells. 

Fig. 4 shows the schematic diagram for cloning of human EPO 
5 and construction of expression vectors. The five blocks represent the 
five coding regions of EPO. The first PGR was performed using 
primers 25 and 33. The amplified DNA fragment was cloned and 
subjected to a second PGR using primers 12 and 9. The wavy tale in 
primer 12 contains the nucleotide sequence from the first coding region. 
10 Therefore, the second PGR generates the entire coding sequence of 
EPO so that the first and the second coding regions are attached to 
form without intron between them. Primers 12 and 9 contain Hindlll 
linkers at their 5' ends, enabling cloning of the EPO genomic sequence 
into various expression vectors. 

15 Fig. 5 is various EPO genomic DNA sequences. SY, SH, HE 

and JM are the EPO genomic DNA sequences cloned by the present 
invention, and AM and Gl are the EPO genomic sequences which has 
been already reported. Since the intron between the first coding 
region and the second coding region was deleted during the cloning, 

20 the deleted intron is not shown in Fig. 5. 

Fig. 6 is various EPO amino acid sequences. SY, SH, HE and 
JM are the EPO amino acid sequences cloned by the present invention, 
and AM and Gl are the EPO amino acid sequences which have been 
already reported. The abbreviation of the amino acids are as follows: 

25 A: alanine R: arginine N: asparagine D: aspartic acid 

G: cystein Q: glutamine E: glutamic acid H: histidine 

I: isoleucine L: leucine K; lysine M: methionine 
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F: phenylalanine P: proline S: serine 

T: threonine W: tryptophan Y: tyrosine V: valine 

Fig. 7 shows the comparison of CAT gene expression between 
QT-VC and other mammalian cell lines. pCMV-CAT was transfected 
5 to QT-VC; CHO-K1 . and Vero cells, and CAT activity was measured as 
described in Fig. 1. The transfection efficiency, as measured by X-gal 
staining following cotransfection with pCMV-lacZ, was reproducibly 3- 
5 % in ail cases. For this particular experiment, 50 ^ig of protein were 
incubated with ^'*C-chloramphenicol for one hour at 37 °C. 

10 Fig. 8 is the typical structure of a plasmid used to express EPO 

in QT cells. The two types of BamHI cassettes which could express 
the gene for human glutamine synthetase (GS) was made. In these 
BamHI cassettes, the GS cDNA sequence was flanked by the poly A 
sequence from the bovine growth hormone gene and one of the two 

15 promoters, the partial MMTV LTR (from -220 to +15 from the RNA start 
site) or the 220 bp HSV tk promoter. The BamHI fragment expressing 
GS was inserted into the BamHI site of pCI-neo (Promega, Madison, 
Wl, USA), resulting in a series of pIGA. The Hindlll fragment of the 
SY-EPO cDNA sequence was cloned into the Smal site of pIGA, 

20 generating the EPO expression vector, pIGA-EPO. 

Fig. 9 shows the production of EPO by QT-N4D4. QT-N4D4 
cells were grown to confluence in a 10 cm culture dish (day 0) in M-199 
containing 10 % FBS and 1 mM MSX. On day 3, the EPO level was 
measured. The cells were then split into 1:3 and seeded onto 10 cm 
25 dishes. On day 6, the cells were again reached confluence, and the 
medium was replaced with 10 ml fresh medium containing 2 % (#) or 
10 % (O) FBS. EPO levels were determined by ELISA (R&D system, 
Minnesota, USA) 
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Fig. 10 shows the comparison of EPO concentration in DE (#) 
and QT-N4D4 (O) nneasured by ELISA and by in vitro bioassay. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The inventors have explored the possibility of using avian cells 
5 as a host cell for heterologous gene expression. We have chosen to 
use three avian cells; two embryonic cells from chicken and duck, and a 
quail fibrosarcoma line. We chose to use the chicken and duck 
embryo cells for the following the reasons. First, these embryonic 
ceils can easily be prepared from eggs, and they divide rapidly, 

10 undergoing many passages. Second, chicken and duck cells can be 
grown at large scale with relatively low costs. Third, some avian cells, 
such as those from chicken embryos have already been used for 
medical products. For example, influenza virus has been cultured in 
chicken eggs for the production of vaccines. Finally, the culture 

15 conditions, including media and temperature, required by avian embryo 
cells are virtually identical to those of mammalian cells, suggesting that 
the physiology of avian and mammalian cells is probably comparable. 

Further, the reason of choosing a QT cell line is that various 
transformed cell lines have been already developed, and it is easy to 

20 handle these cell lines to construct a permanent cell line expressing a 
heterologous protein, and the culture conditions and media is similar to 
those of mammalian cells. 

I. Cells and Plasmids 

1 . Cells 

25 The following Table 1 shows cells used in the experiment. 
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Table 1 



Cells 


Source 


HeLa human cervical carcinoma cells 


ATCC CCL2 


Vero African green monkey kidney ceils 


ATCC CCL81 


COS-7 African green monkey kidney cells 
transformed by wild-type T antigen of 
SV40 


ATCC CRL1651 


CHO-K1 Chinese hamster ovary cells 


ATCC CCL61 


NIH3T3 contacted-inhibited Swiss mouse 
embryo cells 


ATCC CRL1651 


Ad-5 transformed human embryonic 
kidney cells 293 


ATCC CRL1651 


SL-29 chicken embryo fibroblast cells 


ATCC CRL1590 


Duck embryo 


ATCC CCL141 
or prepared by the 
inventors 


Quail fibrosarcoma line QT6 


ATCC CRL1708 


Quail fibrosarcoma line QT-VC 


Isolated by the inventors 
KCTC 0277BP 



All these cells except QT cell lines were grown in Dulbecco's 
modified Eagle's medium (DMEM) supplemented with 10% fetal bovine 

5 serum (FBS). QT cell lines were cultured in Ml 99 medium instead of 
DEME. Duck embryo was either obtained from ATCC CCL 141 or 
prepared by trypsinization of 10- to 13- day old decapitated duck 
embryos. These avian ceils were grown in minimum essential medium 
(Eagle) supplemented with non-essential amino acids and Earle's 

10 balanced salt solution containing 10 % FBS. These cells could be 
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maintained for approximately another 30 passages. Each medium 
used in this study was supplemented with 120 |ig/ml penicillin G 
(Sigma P-3032; 1690 units per mg) and 200 ^g/ml streptomycin (Sigma 
S-9137; 750 units per mg), 

5 2, Plasmids 

To evaluate the efficiency of heterologous protein production in 
avian cells, pRc/RSV-CAT and pRc/CMV-CAT were constructed by 
inserting a Hindlll CAT cassette (Phamnacia. Piscataway, NJ) into the 
Hindlll sites of pRc/RSV and pRc/CMV (Invitrogen, San Diego, 

0 California, USA), respectively. For pSVCAT, the plasmid p9l8 was 
used, which has been already described by the inventors. For EPO 
expression vectors, three vectors were used. pCMV-gEPO was 
constructed by cloning the Hindlll fragments of the EPO genomic 
sequence into the Hindlll site of pRc/CMV. pSV-gEPO was derived by 

5 replacing the CAT sequence of pSV918 with the genomic EPO 
sequence. pIGA-EPO has cDNA of EPO controlled by HCMV MIEP 
and the genes of NEO and glutamine synthetase (hereafter "GS"). To 
measure the transfection efficiency, the piasmid pCMV-lacZ was 
constructed by inserting bacterial lacZ fragment into the Hindi!! site of 

0 pRc/CMV. 

II. DNA Transfection and Gene Expression Assays 

The inventors tested whether avian embryo cells could be used 
for high levels of heterologous gene expression instead of mammalian 
cells. Although avian embryo cells have been used to culture viruses, 
5 there was no report that heterologous proteins of higher eukaryotic cells 
were expressed in these cells. To carry out the study, it is necessary 
to develop the method of efficient transfection to avian cells. That is, 
to express heterologous genes in avian cells, it is required to develop 
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the transfection technique of DNA to target cells. At present, we could 
not find any reports on DNA transfection of avian embryo cells. 
Accordingly, the inventors have developed the technique that CEF and 
DE cells can readily be transfected with DNA: 

5 Among the techniques available, we have chosen a method 

using calcium phosphate coprecipitation, because this works well for 
various adherent cells and can also be used for establishing permanent 
lines. We have tested many different conditions and found that the 
following procedure was optimum. 

10 When cultures were 50-70% confluent in a 100 mm culture dish, 

a total of 10 Jig DNA in HBS buffer (140 mM NaCI, 5 mM KCI, 0.75 mM 
Na2HP04.2H20, 6 mM dextrose, 25 mM HEPES) was incubated with the 
cells for 30 min at room temperature. 10 ml of regular media 
containing FBS was added and incubated for 20 hrs at 37 °C, except for 

15 CHO-K1 (8 hours). Cells were then treated with 10 ml of 100 ^M 
chloroquine, and incubated for another 3 hours at 37 ^'C. After 
replacement with 10 ml of fresh media, the cells were grown for 1 to 2 
days. Culture supernatants were collected and centrifuged at 1000 
rpm for 10 min to remove cells and debris. To measure transfection 

20 efficiency, cells were transfected with pCMV-lacZ, rinsed once with 
PBS 3 days after transfection, fixed with 0.5 % glutaraldehyde (in PBS) 
for 10 min, and washed twice for 2-10 min each with 4 ml PBS 
containing 1 mM MgCb. For X-gal staining, the staining solution [PBS 
containing 4 mM K3Fe(CN)6, 4 mM K4Fe(CN)6.3H20, 2 mM MgCU, and 

25 400 Jig per ml X-gal (in dimethylformamide)] was added to fixed cells, 
and incubated at 37 °C for 4 hours overnight. When the reaction was 
completed, cells were washed once with PBS. Stained cells were kept 
in PBS. 
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CAT assay was carried out as follows: 

Two to three days after transfection, cells were harvested, 
washed once with PBS, and resuspended in 0.25 M Tris-HCI (pH 7.5). 
Total proteins were prepared by 4 cycles of freeze/thawing followed by 
5 heating at 65 °C for 7 min. Equivalent amounts of protein were 
assayed for CAT activity at 37 °C for 30 nnin. The amount of protein 
and the reaction time varied, depending on the experiments. For 
example, the CAT activity of cell extracts prepared from DE cells was 
so high that only 10 ^g protein and 20 to 30 min reaction time had to be 

10 used, and under this condition, levels of CAT activity in other 
mammalian cells were very low or undetectable. When CAT activity 
became detectable in other cells, virtually ail ^"^C-chloramphenicol was 
converted. The percent conversion of ^'*C-chloramphenicol to its 
acetylated forms was determined by cutting out regions containing 

15 unreacted and acetylated forms and quantifying the amount of 
radioactivity in each by liquid scintillation counting. 

III. Gene Expression in DE and CEF 

Gene expression efficiency of DE and CEF was measured using 
CAT gene. We initially chose to use a promoter from the major 

20 immediate-early region of HCMV, because this has been shown to 
drive a high level gene expression in many different cell types. In the 
plasmid pCMV-CAT, the bacterial CAT gene is placed under the control 
of the HCMV MIEP. As a negative control, the plasmid Rc/CMV 
containing the promoter but no CAT sequence was used. These 

25 plasmids were transfected into DE and CEF cells and CAT activity was 
measured to estimate the efficiency of transfection and gene 
expression. One representative result from several independent 
transfections is shown in Fig. 1. Transfection of a control plasmid 
resulted in undetectable levels of CAT activity in both cells. However, 
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transfection with pCMV-CAT resulted in readily detectable levels of 
CAT activity in both cells. In more than five independent transfection 
assays, the level of CAT activity was always higher in DE cells than in 
CEF cells. The magnitude of difference in the level of CAT activity 
between the two cells ranged from 10- to 50-fold, depending on the 
experiment. This result indicated that avian cells were readily 
transfected with DNA and the heterologous genes could be efficiently 
expressed. 

IV. Comparison of Levels of Gene Expression between Avian 
and Mammalian Cells, and between Different Promoters 

We have compared the levels of gene expression between avian 
and mammalian cells, using three different promoters; 

(1) the SV40 early promoter, which is used during the early 
transcriptional phase of SV40 infection; 

(2) the HCMV MIEP. which drives the expression of lEI and IE2 
regulatory proteins, immediately after HCMV infection; 

(3) the RSV LTR from an avian retrovirus. 

These promoters are known to be powerful in mammalian cells, 
and have often been used for high level heterologous gene expression. 

These promoter-CAT fusion constructs were transfected into 
four different cell lines, DE, CEF, CHO-K1, and HeLa, and CAT activity 
measured to compare the efficiency of gene expression between 
promoters and between cell types. To make this comparison 
semi-quantitative, all transfections and CAT assays were performed at 
the same time and using identical conditions. One representative 
result of such experiments is shown in Fig. 2. Here, 10 ^g of cell 
extracts were incubated for 30 min in the CAT reaction. Under these 
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particular conditions, the levels of CAT expression driven from the three 
promoters were very low in CHO and HeLa cells (Fig. 2). Only after 
larger amounts of proteins were used for extended reaction time, was 
CAT activity detected. In contrast, CAT activity was readily detectable 
5 in the avian cells (Fig. 2), except for the SV40 promoter in CEF cells. 
It indicated that the expression in avian cells are more effective than 
that in mammalian cells. 

The most dramatic finding was that the HCMV MIEP was 
extremely powerful in DE cells. In Fig. 2, the conditions used for the 

10 CAT reaction were chosen to generate the reasonable levels of CAT 
activity in other samples. When the CAT reaction was performed 
under limiting conditions for the protein sample prepared from DE cells 
transfected with pCMV-CAT (i.e., when the CAT conversion was below 
50 %), the levels of CAT activity of all the other samples were virtually 

15 undetectable. Therefore, the magnitude of difference in CAT activity 
between the protein sample from DE cells transfected with HCMV-CAT 
and those from the other transfections is at least two orders of 
magnitude. These results suggested that heterologous genes could 
be expressed very efficiently under the control of the HCMV MIEP in 

20 DE cells. 

It is possible that the high levels of CAT expression seen in DE 
cells could be due to efficient transfection of the cell population, rather 
than an ability of these cells to support strong gene expression. To 
distinguish these possibilities, we transfected pCMV-lacZ into DE and 
25 various animal cells. After transfection, cells were stained with X-gal, 
and the number of blue cells were counted to estimate the transfection 
efficiency. As shown in Fig. 3. the number of stained cell was always 
comparable between DE and other animal cells, suggesting that the 
high levels of CAT expression in DE cells were due to high levels of 
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expression in individual cells. 

V. Cloning of human erythropoietin 

To test whether DE ceils could indeed be used for the 
expression of medically important human proteins, we have isolated the 
genomic DNA encoding the human EPO gene. We chose to use EPO 
as a model because it is a secreted protein, so we could test whether 
DE ceils properly process secreted proteins. We also used a genomic 
clone of EPO instead of the cDNA, to assess whether human genes are 
properly spliced to produce functional mRNAs in DE cells. 

DMAs for cloning of EPO were prepared with blood cells 
collected from four people. Human peripheral blood lymphocytes 
were isolated by Ficoll-Hypaque gradient centrifugation of 
heparin-treated blood cells. Total DNA was prepared and used for 
polymerase chain reaction using specific oligonucleotide primers (Fig. 
4). The region around the start codon was highly GO rich, so the EPO 
sequence was cloned by two steps of PGR using two different pairs of 
primers. 

To obtain the genomic DNA for EPO, total DNA was prepared by 
lysing human peripheral blood lymphocytes using TES (10 mM Tris-HCI 
pH 7.8; 1 mM EDTA; 0.7 % SDS) followed by the treatment with 
400 |ig/ml proteinase K at 50 °C for 1 hour, phenol:chlorofonn 
extraction, and ethanol precipitation. The polymerase chain reaction 
(PGR) was perfonned using 0.1 ^g of total genomic DNA and 
oligonucleotide primers specific to the EPO gene. 

Primer #25 (sense, 5* to 3'): GAAGGTGATAAGCTGATAAGG 

Primer #33 (antisense, 5* to 3*): TGTGAGATGGTTAGATGTGA 

The samples were amplified through 30 cycles that included the 
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following parameters; denaturation at 92 ''C for 1 min, primer annealing 
at 55 ""C for 1min, and primer extension at 72 ""C for 1 min. The DNA 
fragment amplified from this reaction did not contain the first 13 
nucleotides in the N-terminal region, so a second PGR was performed 
5 using the following primers (Underlined, Hindlll; Outlined, start codon 
and stop codons, respectively). The relative position of these primers 
are as shown in Fig. 4. Taq DNA polymerase (POSCO Chem, Korea) 
and pfr polymerase (STRATGENE, California, USA) were used to 
amplify DNA. 

10 Primer #12 (sense, 5' to 3*): 

CAAGCTTCGGAGATGGGGTGCACGAATGTCCTGCCTGGCTGTGGC 

Primer #9 (antisense, 5' to 3'): 

C AAGCTT TCATCTGTCCCCTGTCCTGC 

The amplified DNA from the second PGR was cloned into the 
15 pGRII (Invitrogen), from which the Hindlll fragment containing the 
genomic sequence of EPO was inserted into various expression 
vectors as described above. In this experiment, the amplified DNA 
was placed under the control of the HGMV MIEP or SV40 early 
promoter, generating pGMV-gEPO and pSV-gEPO respectively. SY- 
20 EPO whose amino acid sequence is identical to that of the already 
known EPO is used for the expression experiments in the sections VII 
and VIII (See the section VI). 

VI. Analysis of Nucleotide Sequences of Gloned EPO Genomes 

Genomic structure of EPO cloned by the above method is different from 
25 the natural EPO genome in vivo. That is, wild type EPO genomic DNA 
has five coding regions and four introns between them. However, in 
the DNA cloned by the above method, the first coding region was fused 
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to the second coding region to form one coding region so that it has 
four coding regions and three introns (Fig. 4). 

The results from the analysis of EPO gene sequences isolated from 
four people suggested that nucleotide sequences of EPO cloned from 
these region are significantly different from those of the prior two EPOs 
(AM-EPO and GI-EPO) (Fig. 5) at the sites of intron. Such a 
difference was not due to the error which occurred during DNA 
amplification in the process of cloning. We repeated cloning and 
sequencing using DMAs prepared from same individuals (but at 
different times) and obtained the same nucleotide sequence. As 
another control, we amplified the already cloned EPO under the similar 
conditions, and determined the nucleotide sequence. Again, we 
obtained the same nucleotide sequence. 

Amino acid sequences of four EPO genes, together with AM and Gl, 
are shown in Fig. 6. Amino acid sequences from AM, Gl and SY are 
identical. However, amino acid sequences from three people (JM, SH, 
HE) different by two or three different amino acids from Gl- and AM- 
EPO, suggesting that there is a polymorphisms among people. When 
compared with AM- or GI-EPO, HE-EPO had three different amino 
acids at C-terminal. SH-EPO three different amino acid over the whole 
polypeptide, and JM-EPO two different amino acids, one at C-terminal 
and the other in the middle of polypeptide (See Fig. 6). For example, 
while AM-EPO and GI-EPO had serine, alanine, and valine at positions 
36, 100 and 170 respectively, SH-EPO had arginine, serine, and 
tyrosine. Further, while AM-EPO and GI-EPO had valine, lysine, and 
aliginine at positions 170, 177, and 191, HE-EPO had tyrosine, 
glutamine, and glycine. In JM-EPO, lysine and tyrosine were present 
at positions 54 and 170, while they were threonine and valine. These 
results suggested that the EPO gene has a polymorphism in amino 
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acids sequence as well as DNA sequence. 

Vil. Expression of EPO in DE Cells 

In this experinnent, we connpared levels of EPO expression 
between DE cells and other cell lines. 

5 EPO expression vectors were transfected into various cells 

including DE, CEF, OHO, HeLa, VERO, and 293T. We have included 
VERO cells because they are often used for heterologous gene 
expression, and 293T cells which drive very high levels of gene 
expression, presunnably due to both the high frequency of DNA 

10 transfection and the presence of potent viral transactivators such as 
ElA, EIB, and large T antigen. Two to three days after transfection, 
levels of EPO in the culture supernatants were measured by the 
enzyme linked immunoadsorbent assay, and transfection efficiencies 
were determined by staining cells adhered on the culture with X-gal. 

15 Transfection efficiency was carried out by transfection of a lacZ 
expression vector together with an EPO expression vector as described 
in the section II. One representative result of this analysis is 
summarized in Table 2. 



Table 2 



Cell 


HCMV MIEP 


SV40 early 
promoter 


HCMV/SV40 


293 


314 


17.5 


18 


CHO 


139.4 


10.4 


13.5 


VERO 


250 


10.7 


23.5 


NIH3T3 


89 


79.4 


1.1 


DE 


4335 


13.8 


314.8 



20 

When the SV40 early promoter was used, there was little 
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difference in the levels of EPO between cell types. However, when 
the HCMV MIEP was used, DE cells produced much higher levels of 
EPO than any other cell lines tested. The HCMV MIEP was much 
more active than the SV40 early promoter in almost all the cells tested. 
5 This difference was especially pronounced in DE cells, where the 
former produced 315 times more EPO than the latter. Among the 
various cell types, DE cells always produced the highest level of EPO. 
CHO cells are the source of cell lines producing EPO that is currently 
used for human application. In this transient system, however, the 

10 level of EPO in CHO cells was at least 30-fold lower than in DE cells. 
Difference between DE and 293T cells was also considerable. 
Transfection efficiency of 293T was higher by about 30-folder than any 
other cells including DE cells. Moreover, 293T cells produce potent 
viral transcription transactivators. Nevertheless, DE produced 10- 

15 folder more EPO than 293T, suggesting that DE could drive high levels 
of the gene expression. 

In conclusion, human EPO could efficiently be produced and 
secreted in DE cells and that the HCMV MIEP is the promoter of choice 
for driving high level heterologous gene expression in DE cells. 

20 In summary, we found that DE cells could produce very high 

levels of bacterial and human proteins. All three promoters tested 
drove higher levels of gene expression in DE cells than any other cell 
lines used in this study. In particular, the HCMV MIEP was extremely 
powerful in DE cells. The high level of heterologous gene expression 

25 observed was not due to a higher number of transfected cells. It 
appears that DE cells properiy process splicing and secretion because 
transfection of DE cells with an expression vector containing the EPO 
genomic DNA sequence produced a large quantity of EPO in the 
culture supematant. 
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For DE cells to be used for industrial purpose, one would need to 
develop large-scale culture techniques for these cells. There are two 
possible ways. First, it may be possible to use prinnary cells 
thennselves as the producer line. A large number of DE cells can 
5 easily be prepared from 10- to 13 day-old duck embryos. From one 
embryo, we can readily obtain 10® to 10^° cells that can undergo at least 
15 passages. Therefore, it is possible to transfect DE cells at the 
earliest possible stage with an expression vector followed by selection 
of transfected cells, which might require 4-7 passages. Even if less 

10 than 5% of the cells were transfected, a large number of transfected 
cells would be available, suggesting that large-scale culture of primary 
duck embryo cells is not impossible with primary cells. Second, it will 
be possible to transform duck embryo cells at an early stage, using one 
of the large number of well-characterized oncogenes that are available. 

15 With transformed DE cells, a producer line could be constructed, and 
better quality control of protein production be established. It remains 
to be seen whether transformed DE cells will still maintain the capability 
for high level gene expression. Although a number of biological 
questions remain to be answered, the potential of these cells for the 

20 production of various proteins warrants further investigation. 

VIII. Heterologous Gene Expression 
in the Transformed Avian Cell Line 

The above experiments demonstrated the great potential of DE 
cells as producers of heterologous proteins such as EPO. However, 
25 DE cells used in the above experiments are primary cells and stop 
dividing after 30-40 passages in vitro. Therefore, unless DE cells are 
transformed or special techniques are developed as described above, it 
is difficult to use these embryonic cells for industrial production of 
heterologous proteins. 
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In the following study, we tested whether the transfomned avian 
cell line, namely the quail fibrosarcoma line, could be used to produce 
EPO. The quail fibrosarcoma line used in this study. QT-VC. was 
subcloned from QT6 (ATCC CRL1708). This line was derived from 
5 methylcholanthrene-induced fibrosarcoma of Japanese quail. QT-VC 
is different from its parental line in at least two aspects. First, QT-VC 
grows faster than the parental line in M199 medium containing 10% 
FBS used in this study. The former divided every 12-24 hours, while 
the doubling time of the latter was 24-36 hours. Second, the QT-VC 
10 cell looks more roundish than QT6 which generally grows in a longish 
form. Like its parental line, QT-VC did not grow well when it was 
seeded at a low density. Therefore, cells had to be split to 1/3 to 1/2 
after reaching confluence for continuous culture. 

1 . Analysis of Gene Expression in QT-VC Cells 

15 We compared the levels of gene expression between QT-VC 

and mammalian cells using pCMV-CAT. We chose to use the HCMV 
MIEP as this promoter was shown to drive high levels of gene 
expression in various cell types including avian ceils (See the section 
IV). pCMV-CAT was transfected into 3 cell lines, QT-VC, CHO-K1 and 

20 Vero. To make this comparison semi-quantitative, all transfections 
and CAT assays were performed at the same time and using identical 
conditions. Transfection efficiency was also measured by 
cotransfecting pCM-lacZ followed by X-gal staining. The efficiency 
was approximately 3 % in all cases. Under these conditions, the 

25 levels of CAT expression in QT-VC cells were always 2-3 times higher 
than mammalian cell lines used in this study (Fig. 7). Although the 
level of gene expression in QT-VC cells appears to be lower than DE 
ceils, the quail fibrosarcoma line is at least as good as mammalian cell 
lines, suggesting that it could be used as a producer for heterologous 
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proteins. 

2. Construction of EPO Expression Vectors for QT-VC Cells 

To test whether high levels of heterologous proteins could be 
expressed in QT cells, we have constructed other EPO expression 
5 vectors. The basic strategy for the construction of an expression 
vector was as follows: 

First, we chose to use the HCMV MIEP to drive expression of the 
heterologous gene as it had already been shown to be one of the 
strongest promoters in avian cells as well as mammalian cells. 

10 Second, the human glutamine synthetase (GS) gene was used 

for amplification of the target gene. Generally, the gene of interest is 
amplified to augment the yield of protein by using certain selectable 
markers in the presence of specific chemicals. One of the best 
examples is the dihydrofolate reductase (DHFR) gene. It has been 

15 shown that the copy number of the heterologous gene and the level of 
respective protein increase as the concentration of methotrexate (MTX) 
in the medium is slowly increased. However, this system requires the 
host cell defective in the gene DHFR, so cannot be directly applied to 
QT cells for which such a mutant line is not yet available. For this 

20 reason, we chose to use the GS gene. In this case, the host cell line 
need not to be deficient for GS, because only multiple copies of the GS 
gene can confer resistance to methionine sulfoximine (MSX). 

The overall structure of EPO expression vectors constructed for 
the use in QT cells is shown in Fig. 8. In this structure, the cDNA 
25 sequence for EPO is under the control of the HCMV MIEP, the bacterial 
Neo gene is used as the first selectable marker, and the human GS 
gene is also present as the second selectable marker in the same 
plasmid. The backbone of expression vectors used in this particular 
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experiment was pCI-neo (Promega, USA) which uses the HCMV MIEP 
and the intron from the p-globin genome. We have made a couple of 
different constructs in which the GS gene is driven by the partial MMTV 
LTR (from -220 to +15) or the 220 bp HSV tk promoters. In either 
5 case, the magnitude of gene amplification appears to be comparable 
(Data not shown). Detailed procedure and supplementary data 
regarding the construction of expression vectors is available upon 
request. 

3. Construction of QT-VC Cells Stably Expressing EPO 

0 To construct QT-VC cell lines constitutively expressing EPO. the 

cells were transfected with an EPO expression vector by a calcium 
phosphate coprecipitation method as described in the section IL 
Three days after transfection, EPO production was confirmed by EILSA 
and transfected cells were treated with G418 (0.8 mg/m!) and MSX (25 

5 |iM). When G418-resistant cells were grown to confluence, cells were 
diluted for sublconing. Because QT-VC cells do not grow efficiently at 
a low cell density, cells were seeded on 10-cm culture dishes at various 
numbers (10^ 10', 10^ 10^ per dish). Then the colonies that grew 
distant from other colonies were isolated by plastic O rings and 

0 expanded onto a 96-well plate. When cells reached 70 % confluence, 
the EPO level was measured. Subclones that produced more than 
200 U/ml were serially expanded from the 12-well to 6-well to 60 mm 
culture plates. When cells reached confluence on a 60 mm dish, cells 
were split on 6-well plates and then treated with various concentrations 

5 of MSX (100 jiM, 250 jaM, 1 mM). Using this procedure, several 
subclones that produced large amounts of EPO and also grew fast 
were selected. 

One of the subclones obtained through this procedure is QT- 
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N4D4. As shown in Fig. 9, this subclone produced 1200 U/ml when 
grown for 3 days after confluence. When the cells were split to 1:3, 
seeded on 10 cm dishes, and allowed to grow for another 3 days, N4D4 
still produced 1000 U/ml. The medium was then replaced with a fresh 
5 media containing 2 % FBS and the cells still produced 400 U/ml EPO. 
These results indicated that QT cells could produce a large quantity of 
EPO. 

In conclusion, the above experiment demonstrated the great 
potential of QT cells as a producer for heterologous protein. 

10 IX. Biological Activity of EPO Produced in Avian Cells 

EPO is heavily glycosylated and such glycosylation is required 
for its biological activity. For example, EPO produced in E. coli or 
yeast is inactive or very weakly active in vivo. To test whether EPO 
expressed in DE or QT cells was biologically active, we carried out an in 
15 vitro bioassay using spleen cells isolated from mice treated with 
phenylhydrazine, 

EPO assay: Absolute levels of EPO production after 
transfection of various cells were determined by enzyme linked 
immunoadsorbent assay which is currently used to measure EPO 

20 levels in the human serum (R&D Systems Inc., Minnesota, USA). To 
measure the biological activity of EPO, in vitro bioassay was carried out 
by the method of Krystal as modified by Goldberg et al Spleen cells 
were taken from C57BL X C3H Fl hybrid mice (Seoul National 
University Laboratory Animal Center) on day 3 after the second of two 

25 daily injections of phenylhydrazine (60 mg/Kg of body weight per day) 
and spleen cell suspensions were prepared with Lymphoprep™ 
(NYCOMED PHARMA AS, Oslo, NonA/ay). The spleen cells (final 
concentration 4X10^ cells per ml) were then incubated in 24 well tissue 
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culture plates with various standard doses of EPO (CI LAG AG 
International, Switzerland; specific activity 2000 U/ml) or unknown 
sannples for 22 hr and then pulsed with 4 ^Ci/well tritiated thymidine 
(Amersham Co.) for 2-3 hr. The cells were harvested, washed with 
5 PBS several times and lysed by 0.3 N NaOH and 0.1% SDS. 
Radioactivity in LSC cocktail solutions were calculated by a Pharmacia 
Wallac 1410 scintillation counter. 

Culture supernatants from QT-N4D4 cells or DE cells transfected 
with EPO expression vectors were taken to measure levels of EPO by 

10 both ELISA and the bioassay. The ELISA measures absolute 
concentration, and is currently used for determining EPO concentration 
in human serum. On the other hand, the bioassay determines 
biological activity using a control EPO that has been produced from 
mammalian cells and is currently being used in humans. Fig, 10 

15 compares the difference in levels of EPO determined by these two 
methods. The ratio between the values (mU) was I ± 0.15, and the 
specific activity of EPO produced from DE cells was estimated to 105 
U/^g. Therefore, the levels of EPO measured by ELISA were very 
comparable to those obtained by the bioassay. This result suggested 

20 that EPO produced from these avian cells had a similar biological 
activity to commercially available EPO, 
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What is claimed is: 



1 . A heterologous gene expression systenn comprising: 



a DNA encoding a heterologous protein; 



a vector for receiving the DNA; and 



5 



an avian cell for harboring the vector. 



2. The expression system of claim 1 , wherein the heterologous 
protein is selected from the group consisting of TPA, Factor VIII and 
EPO. 

3. The expression system of claim 1, wherein the vector 
10 contains a promoter selected from the group consisting of SV early 

promoter, HCMV MIEP and RSV LTR, 

4. The expression system of claim 1 , wherein the avian cell is 
selected from the group consisting of DE, CEF and QT. 

5. The expression system of claim 4, wherein the QT is QT-VC. 

15 6. The expression system of claim 1, wherein the DNA 

encoding the heterologous protein is DNA or cDNA. 

7. An avian cell as a host for expressing a gene encoding a 
mammalian heterologous protein. 

8. A method of producing a heterologous protein comprising 
20 the steps of; 

culturing the avian cell containing the expression system of claim 
1 to express the gene of the heterologous protein in media; and 

purifying the heterologous protein from the cell and the media. 

9. The method of claim 8, wherein the heterologous protein is 
25 selected from the group consisting of TPA, Factor Vlll and EPO. 
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10. The method of claim 8, wherein the vector contains a 
promoter selected from the group consisting of SV early promoter, 
HCMV MIEP and RSV LTR. 

11. The method of claim 8, wherein the avian cell is selected 
from the group consisting of DE, CEF and QT. 

12. The method of claim 1 1 , wherein the QT is QT-VC. 

13. The method of claim 8, wherein the DNA encoding the 
heterologous protein is DNA or cDNA. 

14. An EPO production system comprising: 
a DNA encoding EPO; 

a vector for receiving the DNA; and 
an avian cell for harboing the vector. 

15. The EPO production system of claim 14, wherein the avian 
cell is DE or QT. 

16. The EPO production system of claim 15, wherein the QT is 
QT-VC. 

17. The EPO production system of claim 14, wherein the DNA 
is a genomic DNA encoding EPO. 

18. The EPO production system of claim 14, wherein the DNA 
encoding EPO is selected from the group consisting of SY, JM, SH and 
HE described in Fig. 5. 

19. The production system of claim 14, wherein the vector 
contains a promoter selected from the group consisting of SV early 
promoter, HCMV MIEP and RSV LTR. 

20. A method of producing EPO comprising the steps of: 
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inserting a DNA encoding an EPO into a vector; 
transfecting the vector into an avian cell; and 
culturing the transfected avian cell in nnedia. 
21 . The method of claim 20, wherein the avian cell is DE or QT, 
5 22. Them method of claim 21 , wherein the QT is QT-VC. 

23. The method of claim 20, wherein the DNA encoding EPO is 
a genomic DNA. 

24. The method of claim 20, wherein the DNA encoding the 
EPO is selected from the group consisting of SY, JM, SH and HE 

10 described in Fig. 5. 

25. The method of claim 20, wherein the vector contains a 
promoter selected from the group consisting of SV40 early promoter, 
RSV LTR and HCMV MIEP. 

26. An EPO genomic sequence selected from the group 
15 consisting of SY, JM, SH and HE described in Fig. 5. 

27. An EPO amino acid sequence selected from the group 
consisting of JM, SH and HE described in Fig. 6. 
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FIG. 3 
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FIG. 4 
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FIG.5A 



AM ATGGGGGTGCACGAATGTCCTGCCTGGCTGTGGCTTCTCCTGTCCCTGCT 50 

GI ATGGGGGTGCACGAATGTCCTGCCTGGCTGTGGCTTCTCCTGTCCCTGCT 50 

SY ATGGGGGTGCACGAATGTCCTGCCTGGCTGTGGCTTCTCCTGTCCCTGCT 50 

JM ATGGGGGTGCACGAATGTCCTGCCTGGCTGTGGCTTCTCCTGTCCCTGCT 50 

SH ATGGGGGTGCACGAATGTCCTGCCTGGCTGTGGCTTCTCCTGTCCCTGCT 50 

HE ATGGGGGTGCACGAATGTCCTGCCTGGCTGTGGCTTCTCCTGTCCCTGCT 50 



AM GTCGCTCCCTCTGGGCCTCCCAGTCCTGGGCGCCCCACCACGCCTCATCT 100 

GI GTCGCTCCCTCTGGGCCTCCCAGTCCTGGGCGCCCCACCACGCCTCATCT 100 

SY GTCGCTCCCTCTGGGCCTCCCAGTCCTGGGCGCCCCACCACGCCTCATCT 100 

JM 6TCGCTCCCTCTGGGCCTCCCAGTCCTGGGCGCCCCACCACGCCTCATCT 100 

SH GTC6CTCCCTCTGGGCCTCCCAGTCCTGGGCGCCCCACCACGCCTCATCT 100 

HE GTCGCTCCCTCTGGGCCTCCCAGTCCTGGGCGCCCCACCACGCCTCATCT 100 

AM GTGACAGCtGAGTCCTGGAGAGGTACCTCTTGGAGGCCAAGGAGGCCGAG 150 

GI GTGACAGCCGAGTCCTGGAGAGGTACCTCTTGGAGGCCAAGGAGGCCGAG 150 

SY GTGACAGpCGAGTCCTGGAGAGGTACCTCTTGGAGGCCAAGGAGGCCGAG 150 

JM GTGACAGCbGAGTCCTGGAGAGGTACCTCTTGGAGGCCAAGGAGGCCGAG 150 

SH GTGACAGACGAGTCCTGGAGAGGTACCTCTTGGAGGCCAAGGAGGCCGAG 150 

HE GTGACAGfcCGAGTCCTGGAGAGGTACCTCTTGGAGGCCAAGGAGGCCGAG 150 

AM AATATCACGGTGAGACCCCTTCCCCAGCACATTCCACAGAACTCACGCTC 200 

GI AATATCACGGTGAGACCCCTTCCCCAGCACATTCCACAGAACTCACGCTC 200 

SY AATATCACGGTGAGACCCCTTCCCCAGCACATTCCACAGAACTCACGCTC 200 

JM AATATCACGGTGAGACCCCTTCCCCAGCACATTCCACAGAACTCACGCTC 200 

SH AATATCACGGTGAGACCCCTTCCCCAGCACATTCCACAGAACTCACGCTC 200 

HE AATATCACGGTGAGACCCCTTCCCCAGCACATTCCACAGAACTCACGCTC 200 
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FIG.5B 



AM 
GI 
SY 
JM 
SH 
HE 



AGGGCTTCAGGG" 
AGGGCTTCAGGG 
AGGGCTTCAGGG 
AGGGCTTCAGGG 
AGGGCTTCAGGG 



AACTCCTCCCAG 
AACTCCTCCCAG 



AACTCCTCCCAG 
AACTCCTCCCAG 
AACTCCTCCCAG - 



qATCCAGGAACCTGGCACTTGGl 



- ATCCAGGAACCTGGCACTTGGTTT 

- ATCCAGGAACCTGGCACTTGGTTT 

- ATCCAGGAACCTGGCACTTGGTTT 

- ATCCAGGAACCTGGCACTTGGTTT 
AGGGCTTCAGGG|G|AACTCCTCCCAG|GATCCAGGAACCTGGCACTTGGTTT 



248 
248 
248 
248 
248 
250 



AM GGGGTGGAGTTGGGAAGCTAGACACTGCCCCCCTACATAAGAATAAGTCT 298 

GI GGGGTGGAGTTGGGAAGCTAGACACTGCCCCCCTACATAAGAATAAGTCT 298 

SY GGGGTGGAGTTGGGAAGCTAGACACTGCCCCCCTACATAAGAATAAGTCT 298 

JM GGGGTGGAGTTGGGAAGCTAGACACTGCCCCCCTACATAAGAATAAGTCT 298 

SH GGGGTGGAGTTGGGAAGCTAGACACTGCCCCCCTACATAAGAATAAGTCT 298 

HE GGGGTGGAGTTGGGAAGCTAGACACTGCCCCCCTACATAAGAATAAGTCT 300 

AM GGTGGCCCCAAACCATACCTGGAAACTAGGCAAGGAGCAAAGCCAGCAGA 348 

GI GGTGGCCCCAAACCATACCTGGAAACTAGGCAAGGAGCAAAGCCAGCAGA 348 

SY GGTGGCCCCAAACCATACCTGGAAACTAGGCAAGGAGCAAAGCCAGCAGA 348 

JM GGTGGCCCCAAACCATACCTGGAAACTAGGCAAGGAGCAAAGCCAGCA6A 348 

SH GGTGGCCCCAAACCATACCTGGAAACTAGGCAAGGAGCAAAGCCAGCAGA 348 

HE GGTGGCCCCAAACCATACCTGGAAACTAGGCAAGGAGCAAAGCCAGCAGA 350 



AM 
GI 
SY 
JM 
SH 
HE 



tcctacSgcctgtggBccagggBc/^ 
TCCTAC -|gcctgtgg|-|ccagggc cag 

TCCTACqGCCTGTGqdcCAGGGpCAA 
TCCTAdGGCCTGTGGdcCAGGGtiCAGd, 
TCCTAddGCCTGTGGdcCAGGGSCA -G, 
TCCTAddGCCTGTGCdcCAGGGCC/il-G 



3CCTTCAGGGACCCTTGACTCC 397 

^CCTTCAGGGACCCTTGACTCC 395 

CCTTCAGGGACCCTTGACTCC 397 

pTTCAGGGACCCTTGACTCC 398 

rCTTCAGGGACCCTTGACTCC 397 

icCTTCAGGGACCCTTGACTCC 399 
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FIG.5C 
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ccgggctgtQtgca 



CCGGGCTGTO 
CCGGGCTG 



fTGCA- 
GCA' 



iTTTGCAnrn 

ccgggctgtgtgca 
ccgggctgtgItgca 

CCGGGCTGTGirGCATTlCCAGAhi 



\T1W 



CAGADGGGCTGTGCTGAACACTGCAGCTTGAAT 
CAGACGGGCTGTGCTGAACACTGCAGCTTGAAT 
CAGAjCGGGCTGTGCTGAACACTGCAGCTTGAAT 
AGAAGGGCTGTGCTGAACACTGCAGCTTGAAT 
CAGACGGGCTGTGCTGAACACTGCAGCTTGAAT 
JCGGGCTGTGCTGAACACTGCAGCTTGAAT 



447 
445 
447 
448 
447 
449 



AM 
GI 
SY 
JM 
SH 
HE 




TATCACTGTCCCAGACACCAAAGTTAATTTCTATGCCTGGAAGAG 497 

JATCACTGTCCCAGACACCAAAGTTAATTTCTATGCCTGGAAGAG 495 

JATCACTGTCCCAGACACCAAAGTTAATTTCTATGCCTGGAAGAG 497 

TATCACTGTCCCAGACACCAAAGTTAATTTCTATGCCTGGAAGAG 498 

TATCACTGTCCCAGACACCAAAGTTAATTTCTATGCCTGGAAGAG 497 

TATCACTGTCCCAGACACCAAAGTTAATTTCTATGCCTGGAAGAG 499 



AM GATGGAGGTGAGTTCCl 

GI GATGGAGGTGAGTTCCl 

SY GATGGAGGTGAGTTCCl 

JM GATGGAGGTGAGTTCCl 

SH GATGGAGGTGAGTTCCl 

HE GATGGAGGTGAGTTCCl 



n 



rCCTTTCTTTTGGAGAATCT 547 

rCCTTTCTTTTGGAGAATCT 545 

rCCTTTCTTTTGGAGAATCT 547 

rCCTTTCTTTTGGAGAATCT 548 

TCCTTTCTTTTGGAGAATCT 545 

TCCTTTCTTTTGGAGAATCT 549 



AM 
GI 
SY 
JM 
SH 
HE 



CATTTGCGAGCCTGATn 
CATTTGCGAGCCTGATn 



CATTTGCGAGCCTGATn 
CATTTGCGAGCCTGATn 
CAnTGCGAGCCTGATn 
CAnTGCGAGCCTGA 



GGATGAAAGGGAGAjAfTGATCGBGGGAAAGGT 597 

GGATGAAAGGGAGAI/iirGATCaAhGGAAAGGT 595 

GGATGAAAGGGAGAi^TGATCGiflGGGAAAGGT 597 

GGATGAAAGGGAGAjGrTGATCGlAlGGGAAAGGT 598 

GGATGAAAGGGAGAkTGATCGAjGGGAAAGGT 595 

uGGATGAAAGGGAGAkfTGATCGWGGGAAAGGT 599 
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FIG.5D 



AM 
GI 
SY 
JM 
SH 
HE 



AAAATGGAGCAGCAGAGATGAGGCTGCCTGGGCGCAGAGGCTCpqGTCTA 647 

AAAATGGAGCAGCAGAGATGAGGCTGCCTGGGCGCAGAGGCTqACGTCTA 545 

AAAATGGAGCAGCAGAGATGAGGCTGCCTGGGCGCAGAGGCTckdGTCTA 647 

AAAATGGAGCAGCAGAGATGAGGCTGCCTGGGCGCAGAGGCTCiACjGTCTA 648 

AAAATGGAGCAGCAGAGATGAGGCTGCCTGGGCGCAGAGGCTCkclGTCTA 645 

AAAATGGAGCAGCAGAGATGAGGCTGCCTGGGCGCAGAGGCTClCAbTCTA 649 



AM 
GI 

SY 
JM 
SH 
HE 



TAATCCCAGGCTGAGAfflSGCCGAGKTGGGAGAATTGCTTGAGCCCTGGAG 697 

TAATCCCAGGCTGAGAffpGCCGAIGkTGGGAGAATTGCTTGAGCCCTGGAG 695 

' .TTGCTTGAGCCCTGGAG 697 

GCCGaIgaTGGGAGAATTGCTTGAGCCCTGGAG 698 

GCCGAlGATGGGAGAATTGCTTGAGCCCTGGAG 695 

iGCCGAGkTGGGAGAATTGCTTGAGCCCTGGAG 699 



TAATCCCAGGCTGAGAlrqGCCGAWATGGGAGAA 
TAATCCCAGGCTGAGA- 
TAATCCCAGGCTGAGACi 
TAATCCCAGGCTGAGA 



AM 
GI 
SY 
JM 
SH 
HE 



GTTCAGACCAACCTAGGCAGC^AGTGAGATCCCCCATCTCTACAAACAT 
GTTCAGACCAACCTAGGCAGCAp"AGTGAGATCCCCCATCTCTACAAACAT 
GTTCAGACCAACCTAGGCAGC/VTAGTGAGATCCCCCATCTCTACAAACAT 
GTTCAGACCAACCTAGGCAGCATAGTGAGATCCCCCATCTCTACAAACAT 
GTTCAGACCAACCTAGGCAGc|4rAGTGAGATCCCCCATCTCTACAAACAT 
GTTCAGACCAACCTAGGCAGCCTAGTGAGATCCCCCATCTCTACAAACAT 



747 
747 
747 
748 
745 
749 



AM TTAAAAAAATTAGTCAGGTGAAGTGGTGCATGGTGGTAGTCCCAGATATT 797 

GI TTAAAAAAATTAGTCAGGTGAAGTGGTGCATGGTGGTAGTCCCAGATATT 795 

SY TTAAAAAAATTAGTCAGGTGAAGTGGTGCATGGTGGTAGTCCCAGATATT 797 

JM TTAAAAAAATTAGTCAGGTGAAGTGGTGCATGGTGGTAGTCCCAGATATT 798 

SH TTAAAAAAATTAGTCAGGTGAAGTGGTGCATGGTGGTAGTCCCAGATATT 795 

HE TTAAAAAAATTAGTCAGGTGAAGTGGTGCATGGTGGTAGTCCCAGATATT 799 
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FIG.5E 



AM 
GI 
SY 
JM 
SH 
HE 



tggmsgctgaggcgggaggatcgcttgagcccaggmtttgplsgctgck 
tgg/iwsgctgaggcgggaggatcgcttgagcccaggaatttgkggctgca 
tggmggctgaggcgggaggatcgcttgagcccaggaatttdaggctgca 
tgg4aggctgaggcgggaggatcgcttgagcccaggaatttgWggctgcg 
tggm3gctgaggcgggaggatcgcttgagcccaggaatttq3ggctgca 

TGG/^^GGCTGAGGCGGGAGGATCGCTTGAGCCCAGGAATTTOAGGCTGCA 



847 
845 
847 
848 
845 
849 



AM 
GI 
SY 
JM 
SH 
HE 



AM 
GI 
SY 
JM 
SH 
HE 



AM 
GI 
SY 
JM 
SH 
HE 



GTGAGCTGTGATCACACCACTGCAClTCCAGCCTCAGTGACAG, 
GTGAGCTGTGATCACACCACTGCACYCCAGCCTCAGTGACAG, 
GTGAGCTGTGATCACACCACTGC 
GTGAGCTGTGATCACACCACTGC, 
GTGAGCTGTGATCACACCACTGC 

GTGAGCTGTGATCACACCACTGCAlcYCCAGCCTCAGTGACAGAlGfrGAGGC 



GAGGC 
GAGGC 

TccagcctcagtgacagaWtgaggc 

TCCAGCCTCAGTGACAGAlqrGAGGC 
TCCAGCCTCAGTGACAGAcjrGAGGC 



cctgtctcaaaaarasaaaagaaaaaagaaaaatpjatgagggctgtatgga 
cctgtctcaaaaakgaaaagaaaaaagaaaaataiatgagggctgtatgga 
cctgtctcaaaaaagaaaagaaaaaagaaaaatKatgagggctgtatgga 
cctgtctcaaaaaa|3aaaagaaaaaagaaaaatWatgagggctgtatgga 
cctgtctcaaaaacgaaaagaaaaaagaaaaataiatgagggctgtatgga 
cctgtctcaaaaaWbaaaagaaaaaagaaaaatVatgagggctgtatgga 

ataciatrtcattattcattcactcactcactcactcatjtlcattcattcatt 
atacqttcattattcattcactcactcactcactcatjticattcattcatt 
atac^cattattcattcactcactcactcactcatpcattcattcatt 
atacafrtcattattcattcactcactcactcactcatjtcattcattcatt 
ataciaittcattattcattcactcactcactcactcatfcattcattcatt 
ataca|ttcattatrcattcactcactcactcactcatncattcattcatt 



897 
895 
897 
898 
895 
899 



947 
945 
947 
948 
945 
949 



997 
995 
997 
998 
995 
999 
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FIG.5F 



AM 
GI 
SY 
JM 
SH 
HE 



CATTCAACA' 
CATTCAAC, 



CATTCAACAgGTCmTTGCATACCTTCTGTTTGCTCAGCTTGGTGCTfflS 1047 

AltTCTTATTGCATACCTTCTGTTTGCTCAGCTTGGTGCTTG 1045 

AlAGTCTTATTGCATACCnCTGTTTGCTCAGCTTGGTGCTCt] 1 047 

CATTCAACAAGTCTTATTGCATACCTTCTGTTTGCTCAGCTTGGTGCTTG 1 048 

CATTCAACAlAGTCTTATrGCATACCTTCTGTTTGCTCAGCTTGGTGCTht 1045 

CATTCAACAAGTCTTATTGCATACCTTCTGTTTGCTCAGCTTGGTGCTtlis 1049 



********** 



AM 
GI 
SY 
JM 
SH 
HE 



GGGpT|GpTGAGGGGCAGGAGGGra3AGGGTGACATGGQTCAG):TGACTCCC 1 097 

GGGCTjGCTGAGGGGCAGGAGGGAiGAGGGTGACATCCCTCAiGCTGACTCCC 1 095 

GGGCTpCTGAGGGGCAGGAGGGhGAGGGTGACATGGGirCAGCTGACTCCC 1097 

GGGfcTjGCTGAGGGGCAGGAGGGreAGGGTGACATGGGrrCAGCTGACTCCC 1 098 

GGpCTTTpTGAGGGGCAGGAGGGAiGAGGGTGACATCGGiTCAGCTGACTCCC 1095 

GGjSCTGCTGAGGGGCAGGAGGGAGAGGGTGACATBGGTCAAbTGACTCCC 1099 



AM 

GI 
SY 
JM 
SH 
HE 



AM 
GI 
SY 
JM 
SH 
HE 



AGAGTCCACTCCCTGT]^GTCGGGCAgtAGGCCGTAGAAGTCTGGCAGGG 1 147 

AGAGTCCACTCCCTGmGGTCGGGCAGt AGGCCGTAGAAGTCTGGCAGGG 1 145 

AGAGTCCACTCCCTGTy\i3GTCGGGCAACAGGCCGTAGAAGTCTGGCAGGG 1147 

AGAGTCCACTCCCTGT^GTCGGGCAbCAGGCCGTAGAAGTCTGGCAGGG 1 148 

AGAGTCCACTCCCTGlTAiSGTCGGGCAbCAGGCCGTAGAAGTCTGGCAGGG 1 145 

AGAGTCCACTCCCTGTAGGTCGGGCAGCAGGCCGTAGAAGTCTGGCAGGG 1 149 

CCTGGCCCTGCTGTCGGAAppTGTCCTGCGGGGCCAGGCCCTGTTGGTCA 1 197 

CCTGGCCCTGCTGTCGGAAGCTGTCCTGCGGGGCCAGGCCCTGTTGGTCA 1 195 

CCTGGCCCTGCTGTCGGAAGCTGTCCTGCGGGGCCAGGCCCTGTTGGTCA 1 197 

CCTGGCCCTGCTGTCGGAAGCTGTCCTGCGGGGCCAGGCCCTGTTGGTCA 1 198 

CCTGGCCCTGCTGTCGGAATCTGTCCTGCGGGGCCAGGCCCTGTTGGTCA 1 195 

CCTGGCCCTGCTGTCGGAAGCTGTCCTGCGGGGCCAGGCCCTGTTGGTCA 1199 
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FIG.5G 



AM 
GI 
SY 
JM 
SH 
HE 



ACTCTTCCCAjGtCGTGGGAGCCCCTGCAGCTGCATGTGGATAAAGCCGTC 1247 

ACTUrrCCCAIGbCGTGGGAGCCCCTGCAGCTGCATGTGGATAAAGCCGTC 1245 

ACTTiTTCCCAGCCGTGGGAGCCCCTGCAGCTGCATGTGGATAAAGCCGTC 1247 

ACWTCCCaIgIcCGTGGGAGCCCCTGCAGCTGCATGTGGATAAAGCCGTC 1248 

ACTiqrrCCCAWcCGTGGGAGCCCCTGCAGCTGCATGTGGATAAAGCCGTC 1245 

ACTmCCCAGlcCGTGGGAGCCCCTGCAGCTGCATGTGGATAAAG 1249 



AM AGTGGCCTTCGCAGCCTCACCACTCTGCTTCGGGCTCTGGGAGCCCAGST 1297 

GI AGTGGCCTTCGCAGCCTCACCACTCTGCTTCGGGCTCTGGGAGCCCAG&T 1295 

SY AGTGGCCTTCGCAGCCTCACCACTCTGCTTCGGGCTCTGGGAGCCCAGpr 1297 

JM AGTGGCCTTCGCAGCCTCACCACTCTGCTTCGGGCTCTGGGAGCCCAGGfT 1298 

SH AGTGGCCTTCGCAGCCTCACCACTCTGCTTCGGGCTCTGGGAGCCCAG^ 1295 

HE AGTGGCCTTCGCAGCCTCACCACTCTGCrrCGGGCTCTGGGAGCCCAGGh" 1299 



AM 
GI 
SY 
JM 
SH 
HE 



GAGTAGGAG 
GAGTAGGAG 
GAGTAGGAG 
GAGTAGGAG 
GAGTAGGAG 
GAGTAGGAG 



3GACACTTCTGCTTGCCC1 
t^lGGACACTTCTGCTTGCCCl 
inuGACACTTCTGCTTGCCd 
iGACACTTCTGCTTGCCCl 
ipuGACACTTCTGCTTGCCCl 
upGACACTTCTGCTTGCCCl 



lAGAAGG 1347 

feAGAAGG 1345 

iAGAAGG 1347 

3AGAAGG 1348 

TGTAAGAAGGAIgaGAAGG 1345 

drGTAAGAAGGGGAGAAGG 1349 



TGTAAGAAGG 
CjTGTAAGAAGG 
CTGTAAGAAGG 
GTGTAAGAAGG3 



AM GTCTTGCTAAGGAGTACAGG/^TGTCCGTATTCCTTCCCTTTCTGTGGC 1397 

GI GTCTTGCTAAGGAGTACAGGAa|cTGTCCGTATTCCTTCCCTTTCTGTGGC 1395 

SY GTCTTGCTAAGGAGTACAGGMjCTGTCCGTATTCCTTCCCTTTCTGTGGC 1397 

JM GTCTTGCTAAGGAGTACAGGAACTGTCCGTATTCCTTCCCTTTCTGTGGC 1398 

SH GTCTTGCTAAGGAGTACAGGAAtTGTCCGTATTCCTTCCCTTTCTGTGGC 1395 

HE GTCTTGCTAAGGAGTACAGGa|t|cTGTCCGTATTCCTTCCCTTTCTGTGGC 1399 
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FIG.5H 



AM ACTGCAGCGACCffCCTGl 

GI ACTGCAGCGACCTCCTGl 

SY ACTGCAGCGACCTCCTGl 

JM ACTGCAGCGACCrrlcCTGl 

SH ACTGCAGCGACCrfCCTGl 

HE ACTGCAGCGACdAhCTGl 



TCTCCTTGGCAGAAGGAAGCCATCTCCCCT 1447 

TCTCCTTGGCAGAAGGAAGCCATCTCCCCT 1445 

TCTCCTTGGCAGAAGGAAGCCATCTCCCCT 1447 

TCTCCTTGGCAGAAGGAAGCCATCTCCCCT 1448 

TCTCCTTGGCAGAAGGAAGCCATCTCCCCT 1445 

TCTCCTTGGCAGAAGGAAGCCATCTCCCCT 1449 



AM 
GI 
SY 
JM 
SH 
HE 



ccagatgcggcctcagctgctccactccgaacaatcactgctgaDacttt 
ccagatgcggcctcagctgctccactccgaacaatcactgctgackcttt 
ccagatgcggcctcagctgctccactccgaacaatcactgctgacIacttt 
ccagatgcggcctcagctgctccactccgaacaatcactgctgaciacttt 
ccagatgcggcctcagctgctccactccgaacaatcactgctgajmc 
ccagatgcggcctcagctgctccactccgaacaatcactgctgapacttt 



1497 
1495 
1497 
1498 
1495 
1499 



AM CCGCAAACTCTTCCGAGTCTACTCCAATTTCCTCCGGGGApAGCTGAAGC 1547 

GI CCGCAAACTCTTCCGAGTCTACTCCAATTTCCTCCGGGGAWaGCTGAAGC 1545 

SY CCGCAAACTCTTCCGAGTCTACTCCAATTTCCTCCGGGGaWaGCTGAAGC 1547 

JM CCGCAAACTCTTCCGAGTCTACTCCAATTTCCTCCGGGGaKIaGCTGAAGC 1548 

SH CCGCAAACTCTTCCGAGTCTACTCCAATTTCCTCCGGGGAhlAGCTGAAGC 1545 

HE CCGCAAACTCTTCCGAGTCTACTCCAATTTCCTCCGGGGAbAGCTGAAGC 1549 



AM 
GI 
SY 
JM 
SH 
HE 



TGTACACAGGGGAGGCCTGCAGGACAGGGGACpteATGA 1584 

TGTACACAGGGGAGGCCTGCAGGACAGGGGACiAGATGA 1582 

TGTACACAGGGGAGGCCTGCAGGACAGGGGAcIaOATGA 1 585 

TGTACACAGGGGAGGCCTGCAGGACAGGGGACaIhaTGA 1 585 

TGTACACAGGGGAGGCCTGCAGGACAGGGGAcIaGATGA 1583 

TGTACACAGGGGAGGCCTGCAGGACAGGGGAcIgGATGA 1 586 

ir-k'k'k'k'kiTi^'k'k'k^^'k-k'k-k'k'k'k-k'k'k'k'k'k'k-k'k'k'k-k k-k^k-k 
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FIG.6 



AM/GI MGVHECPAWLWLLLSLLSLPLGLPVLGAPPRLICDpjRVLERYLLEAKEAE 50 

SY MGVHECPAWLWLLLSLLSLPLGLPVLGAPPRLICdSRVLERYLLEAKEAE 50 

JM mgvhecpawlwlllsllslplglpvlgapprlicd|s|rvlerylleakeae 50 

SH mgvhecpawlwlllsllslplglpvlgapprlicdrrvlerylleakeae 50 

HE MGVHECPAWLWLLLSLLSLPLGLPVLGAPPRLICOlsbLERYLLEAKEAE 50 



AM/GI 

SY 

JM 

SH 

HE 



NI 
NI 
N 

NI 



TfflGC 
TTGC 



NI 



CAEHCSLNENITVPDTKVNFYAWKRMEVGQQAVEVWQGLALLSEA 
pCAEHCSLNENITVPDTKVNFYAWKRMEVGQQAVEVWQGLALLSEA 
ITKlGCAEHCSLNENITVPDTKVNFYAWKRMEVGQQAVEVWQGLALLSEA 
CAEHCSLNENITVPDTKVNFYAWKRMEVGQQAVEVWQGLALLSElS 
CAEHCSLNENITVPDTKVNFYAWKRMEVGQQAVEVWQGLALLSEk 



frjGC 



TjTpi 



AM/GI ■ VLRGQALLVNSSQPWEPLQLHVDKAVSGLRSLTTLLRALGAQKEAISPPD 150 

SY VLRGQALLVNSSQPWEPLQLHVDKAVSGLRSLTTLLRALGAQKEAISPPD 150 

JM VLRGQALLVNSSQPWEPLQLHVDKAVSGLRSLTTLLRALGAQKEAISPPD 150 

SH VLRGQALLVNSSQPWEPLQLHVDKAVSGLRSLTTLLRALGAQKEAISPPD 150 

HE VLRGQALLVNSSQPWEPLQLHVDKAVSGLRSLTTLLRALGAQKEAISPPD 150 



AM/GI AASAAPLRTITADTFRKLFRVf7^NFLRG|<).KLYTGEACRTGDR 

SY AASAAPLRTITADTFRKLFRVVSNFLRGKLKLYTGEACRTGDR 

JM aasaaplrtitadtfrklfrvy^nflrgkI^klytgeacrtgdr 

SH AASAAPLRTITADTFRKLFRVY^NFLRCkLKLYTGEACRTGDR 

HE AASAAPLRTITADTFRKLFRVvbFLRG^lKLYTGEACRTGDb 



193 
193 
193 
193 
193 
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FIG.9 
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